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The HMW1/HMW2 Two-Partner 
Secretion Systems Have a Third 
Partner 

The HMWl and HMW2 adhesins of 
nontypeable Haemophilus influenzae are 
high-molecular weight proteins that are 
secreted by the two-partner secretion 
(TPS) pathway, also known as the Type 
Vb secretion pathway [1,2]. TPS systems 
typically consist of a large extracellular 
protein called a TpsA protein (encoded by 
a tpsA gene) and a cognate outer mem- 
brane pore-forming translocator protein 
called a TpsB protein (encoded by a tpsB 
gene). HMWl and HMW2 are TpsA 
proteins and are encoded by hmwlA and 
hmw2A, respectively, and HMWIB and 
HMW2B are the cognate TpsB proteins 
and are encoded by hmwlB and hmw2B, 
respectively [3,4]. The hmwlA-hmwlB and 
hmw2A-hmw2B gene clusters have a similar 
configuration and are located in physically 
separate regions of the H. influenzae 
chromosome. 

A distinctive feature of tlie HMWl and 
HMW2 systems is the presence of a tliird 
protein, called HMWIC in the HMWl 
system and HMW2C in tlie HMW2 system. 
HMWIC and HMW2C are highly homolo- 
gous glycosyltransferases [5,6] that are re- 
sponsible for adding sugar moieties to HMWl 
and HMW2 and are encoded by the hmwlC 
and hmw2C genes, located downstream of 
hmwlB and hmw2B, respectively. Since the 
HMWl and HMW2 systems have similar 
properties [7], in this review we will confine 
our discussion to the HMWl system. 

The HMW 1 adhesin is presented on the 
bacterial surface via a multistep process 
that requires HMWlC-mediated glycosyl- 
ation (reviewed in [8]). As shown sche- 
matically in Figure 1, HMWl is synthe- 
sized and glycosylated in the cytoplasm 
and is directed to the Sec translocase in 
the inner membrane via an extended N- 
terminal signal sequence [9]. The signal 
sequence is cleaved by signal peptidase I, 
and nascent HMWl is then directed to its 
cognate HMWIB fi-barrel pore in the 
outer membrane [9]. The initial interac- 
tion between HMWl and HMWIB oc- 



curs via the N-terminal TPS secretion 
domain in the HMWl pro-piece and the 
periplasmic domain in HMWIB [10]. The 
HMWl pro-piece spans amino acids 69- 
441 and is cleaved during or following 
secretion through the HMWIB pore [9]. 
HM\\'l is ultimately tethered to the 
bacterial surface via a noncovalent inter- 
action that requires the C-terminal 20 
amino acids of the protein and is depen- 
dent upon disulfide bond formation be- 
tween two conserved cysteine residues in 
this region (cysteines 1518 and 1528). 
Immunolabeling studies have demonstrat- 
ed that the immediate C terminus of 
HMWl is inaccessible to surface labeling, 
suggesting that it remains in the periplasm 
or is buried in the HMWIB pore [5,11]. 
Elimination of HMW 1 C results in degra- 
dation of HMWl in bacterial lysates, 
indicating that glycosylation is required 
for HMWl stability. Any remaining non- 
glycosylated HMWl is released into the 
culture supernatant, indicating that glyco- 
sylation is also required for HMWl 
tethering to the bacterial surface [5]. 

Manual analysis of mass spectra of 
HMWl was required to recognize that 
glycan structures are present at asparagine 
residues in conserved NXS/T motifs, 
reflecting the fact that the modifying 
carbohydrates are mono-hexose or di- 
hexose groups rather than complex poly- 
saccharides [12,13]. There are at least 31 
residues that are modified with glucose, 
galactose, glucose-glucose, or glucose-ga- 
lactose residues in the mature surface- 
localized HMWl protein [12,13]. Based 
on biochemical analysis and examination 



of the crystal structure of the HMW 1 pro- 
piece, the pro-piece is nonglycosylated, 
perhaps because glycosylation would in- 
terfere with cleavage of this fragment, 
which occurs by an undefined mechanism 
(Figure 1) [9,12,14]. 

HMW1C is the Prototype 
Member of a New Subfamily of 
Glycosyltransferases 

Protein glycosylation occurs in all king- 
doms of life and is thought to influence 
protein folding, stability, and function 
[15,16]. Some bacteria produce complex 
0-linked or jV-linked glycosyltransferase 
systems. These systems have been studied 
in pathogenic bacteria and glycosylate 
proteins that are typically surface exposed, 
suggesting a role for glycosylation in 
bacteria-host interactions [17]. However, 
none of the previously studied bacterial 
glycosyltransferase pathways operates like 
HMWIC, which is capable by itself of 
forming both jV-Unked carbohydrate bonds 
to the HMWl polypeptide and 0-glycosid- 
ic bonds between hexose sugars [12,18,19]. 

Based on homology analysis and mo- 
lecular modeling, HMW 1 C belongs to the 
GT41 family of glycosyltransferases, a 
family that otherwise contains 0-GlcNAc 
transferases. HMWIC consists of three 
discrete domains, including an ot-helical 
AAD domain at the N terminus and two 
Rossman-like domains that create a GT-B 
fold at the C terminus. Interestingly, the 
AAD fold in HMWIC differs from the so- 
called tetratricopeptide repeats (TPR) fold 
that is characteristic of the GT41 family. 
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Figure 1. Glycosylation by HMW1C may play several roles In promoting HMW1 
stability, export, folding, and function. HMW1C has the potential to contribute to several 
different processes that occur during HMW1 synthesis and transit across the inner and outer 
membranes. First of all, HMW1C glycosylates the HMWl adhesin in the cytoplasm and is likely to 
be involved in the stability of the HMWl adhesin during or after its synthesis. Glycosylation may 
contribute to stability of HIV1W1 in the (A) cytoplasm or (C) periplasm [5,7]. Alternatively, the 
HIVIW1 C protein may improve stability of HIVIW1 by acting as a (B) chaperone prior to secretion of 
the adhesin. It is unlikely that the activity of HIV1W1C is required for export of the adhesin across 
either the inner or outer membrane, as fully processed HMWl is found in the supernatant in the 
absence of HMW1C [5]. It is unclear whether glycosylation influences interaction of HMWl with 
the (D) HMWl B periplasmic domain prior to transit, (E) the HMWl B pore during transit, or (F) the 
docking region of HMWl B upon surface tethering [5,1 1 ]. It is also unclear whether glycosylation 
participates in (G) protein folding upon export. Evidence from the nonglycosylated Bordetella 
prototypic, two-partner, secreted adhesin FHA indicates that this adhesin remains unfolded in the 
cytoplasm and folds very rapidly upon export via its TpsB secretion pore [30]. One hypothesis is 
that the energy generated by this rapid folding is at least part of what drives export of TpsA 
proteins across the outer membrane [2]. Finally, glycosylation of HMWl may be required for (H) 
adherence to host cells or host interaction in a particular niche. 
doi:1 0.1 371/journal.ppat.1 003977.g001 



and the contacts between tlie AAD 
domain and tlie GT-B domain in 
HMWIG create a unique groove that is 
absent in other linown members of the 
GT41 family. Thus, the HMWIG protein 
represents a novel glycosyltransferase sub- 
family [13]. 

Among the best-described bacterial jV- 
linked glycosyltransferase systems is the 



Pgl system in Campylobacter jejuni [20-22]. 
While both the Pgl system and HVIWIG 
affrx sugars to their target proteins at 
asparagines, the similarities end there. 
First, the Pgl system consists of at least 
ten proteins encoded by a gene cluster 
[23], rather than a single protein like 
HIVIWIG. Second, the Pgl enzymes are 
active in the periplasm, while HMWIG 



acts in the cytoplasm [5]. Third, the Pgl 
enzymes add a heptasaccharide that is 
most likely formed in the cytoplasm on a 
lipid carrier that is then flipped into the 
periplasm. In contrast, HMWIG adds 
single UDP-hnked sugars to HMWl 
without the contribution of a lipid carrier 
[12]. 

The HMW1C-Like 
Glycosyltransferases Segregate 
into Two Subsets 

Based on homology analysis of predict- 
ed amino acid sequences, HMWlG-like 
proteins are prevalent among bacteria in 
the Pastuereallaceae, Enterobacteriaceae, 
Neisseriaceae, and Burkholderiaceae fam- 
ilies and appear to segregate into two 
categories [24] . The first category contains 
enzymes encoded by genes adjacent to 
predicted tpsA and tpsB genes in apparent 
TPS systems (Figure 2A). Examples of 
HMWIG -like proteins that fall into this 
category are EtpG in enterotoxigenic 
Escherichia coli (ETEG) [25], RscG in 
Yersinia enterocolitica [26], and predicted 
HMWIG -like glycosyltransferases in T. 
pestis, T. pseudotuberculosis, and Burkholderia 
spp. Among these proteins, only EtpG has 
been demonstrated to possess glycosyl- 
transferase activity, adding sugar residues 
to the target EtpA adhesin [25]. In fact, 
glycosylation of EtpA appears to affect 
adhesin interaction with host cells, as 
nonglycosylated EtpA is less adherent to 
Gaco-2 intestinal epithelial cells but hy- 
peradherent to HGT-8 intestinal cells 
when compared to glycosylated EtpA 
[25]. By analogy to the HMWl system 
and the Etp system, we hypothesize that 
the HMWlG-like enzymes in this category 
modify the co-produced TpsA protein. 

The second category of HMWlG-like 
proteins contains enzymes that are encod- 
ed by genes adjacent to genes encoding 
ribosomal proteins and other transferase 
enzymes involved in carbohydrate modifi- 
cation. Members of this category have no 
obvious predicted protein target for glyco- 
sylation and include the crystallized Acti- 
nobacillus pleuropneumoniae HlMWlG-like 
protein (ApHMWlG) and predicted 
HIVlWlG-like proteins in H. ducreyi and 
Kingella kingae (Figure 2B). Of the proteins 
in this category, only ApHMWlG has 
been characterized and is known to 
possess glycotransferase activity 
[18,19,27]. Recent work has demonstrated 
that the ApHMWlG glycosyltransferase 
preferentially decorates predicted auto- 
transporter proteins as well as other outer 
membrane proteins when expressed in 
vivo and in E. coli [27], suggesting that 
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Figure 2. HMW1C-like proteins in two categories: Those encoded by loci that contain obvious substrate genes and those encoded 
by isolated genes without adjacent substrate genes. Numbers below hmwlC-Wke genes represent translated protein sequence percent 
identity/similarity when compared to H. influenzae HMWIC. (A) HMWIC-like enzymes encoded in apparent TPS systems. (B) HMWIC-like enzymes 
encoded in loci without obvious surface protein targets for glycosylation. Abbreviations: Hi, H. influenzae 86-028NP; Bsp, Burkholderia species 
GCE1003; Ec, Enterotoxigenic E. coll HI 0407; Yp, Y. pseudotuberculosis YPIII; Ap, Actinobacillus pleuropneumoniae L20; Hd, H. ducreyi HD35000; Kk, K. 
kingae 269-492; hyp, hypothetical with no conserved domains; /lyp', predicted lipoprotein; hyp , predicted UDP-glcNAc carboxyvinyltransferase; hyp^, 
predicted 2 C-methyl-D erythritol-4-phosphate cytidyltransferase; hyp'*, predicted deoxyguanosinetriphosphate triphosphohydrolase. 
doi:1 0.1 371 /journal.ppat.1 003977.g002 



enzymes in this category may generally 
have a preference for glycosylation of 
outer membrane proteins. It is interesting 
to note that the HMWIC-like glycosyl- 
transferases in this second category are 
more homologous to the prototype 
HMWIC protein than are the HMWIC- 
like proteins encoded by TPS loci. 

HMWIC-Like Enzymes Have a 
Weil-Defined Structure 
Involved In Binding and 
Transferring UDP-Hexoses 

The crystal structure of ApHMWlC 
provides some clues as to how HMWIC- 
like glycosyltransferases are able to deco- 
rate asparagines. When overlaid using 
structural prediction algorithms, HMW 1 C 
and ApHMWlC are nearly identical, 
differing only by a disordered 30-amino 
acid N-terminal tail that is present in 
HMWIC and absent in ApHMWlC [19]. 
Given this level of identity, predictions 
about structure-function relationships in 
ApHMWlC are likely to apply to H. 
influenzae HMWIC and potentially other 
HMWIC-like proteins. Consistent with 
this conclusion, mutation of amino acids 
in HMWIC corresponding to amino acids 
that line the likely UDP-hexose binding 
pocket in ApHMWlC eliminates the 
ability of HMWIC to glycosylate HMWl 
or to produce functional HMWl when 
expressed in whole bacteria [19]. Exami- 



nation of the ApHMWlC structure indi- 
cates a funnel-shaped groove immediately 
adjacent to the predicted UDP-hexose 
binding pocket. When key residues within 
this groove are mutated in HMWIC, 
glycosylation of HMWl is eliminated, 
indicating that the groove plays an impor- 
tant role in acceptor protein modification 
[19]. A complete understanding of the 
specificity and limitations of the HMWIC- 
like enzyme family may help in an 
industrial setting, as a single enzyme able 
to catalyze the first step of jV-linked protein 
glycosylation without a lipid carrier has 
potential to become a workhorse for in 
vitro protein production [28]. 

What Do We Still Want to Learn 
About HMW1C-Like 
Glycosyltransferases? 

Despite the relative simplicity of 
HMWIC -mediated glycosylation, there is 
stiU much to learn about how the HMWIC 
prototype operates and even more to learn 
about the function and targets of HMWIC 
homologs in other medically relevant 
bacteria. First, we do not understand how 
HMWIC recognizes its target and chooses 
specific glycosylation sites, as only a subset 
of NXS/T motifs in HMWl are decorated 
based on mass spectrometry analysis. A 
better understanding of HMWIC target 
recognition may assist in development of in 
vitro glycosylation systems for protein 



manufacture and also increase understand- 
ing of protein-protein interactions in bac- 
teria. Second, while it is clear that 
HMWIC interacts directly with HMWl 
[5] , the number of HMWl C molecules that 
bind to HMWl at any given time and the 
duration of this interaction in the cytoplasm 
are still unknown. This information may 
help to clarify whether HMWIC has both 
glycosyltransferase activity and chaperone 
activity to stave off degradation of HMWl 
(Figure 1). Third, the glycosylation targets 
in bacteria that have no co-transcribed tpsA 
gene alongside the gene that codes for the 
HMWIC-like enzyme remain to be deter- 
mined, although progress has been made in 
identifying potential targets of ApHMWlC 
[27]. Knowledge of these targets wiU help 
to clarify whether there are consensus 
glycosylation target sequences among 
HMWIC-like enzymes and whether the 
cellular location of glycosylation is con- 
served. Fourth, while it is clear that 
glycosylation is required for HMWl teth- 
ering to the bacterial surface, the mecha- 
nism involved is unknown. Finally, it will 
also be important to elucidate how 
HAFWlC-mediated glycosylation of target 
proteins affects bacterial pathogenesis, ex- 
panding the limited literature on the role 
that HMWlC-mediated glycosylation may 
play in bacteria-host interactions 
[19,25,26,29]. 

The HMWIC-like glycosyltransferases 
have novel features, and advances in our 
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understanding of this interesting group of 
enzymes will likely have important impli- 
cations for the field of glycobiology and 
the field of bacterial pathogenesis. 
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